# Italian speech recognition

Whisper Large V3 Distil It V0.2
MIT
A 2-layer decoder distilled Whisper speech-to-text model optimized for Italian, improving efficiency while maintaining accuracy
Speech Recognition Transformers Other
W
bofenghuang
129
1
Exp W2v2t It Vp Fr S821
Apache-2.0
An Italian automatic speech recognition model fine-tuned from facebook/wav2vec2-large-fr-voxpopuli, trained using the Common Voice 7.0 Italian dataset
Speech Recognition Transformers Other
E
jonatasgrosman
27
0
Exp W2v2t It Wavlm S895
Apache-2.0
An Italian automatic speech recognition model fine-tuned based on microsoft/wavlm-large, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
42
0
Exp W2v2t It No Pretraining S842
Apache-2.0
Fine-tuned from a randomly initialized wav2vec2 model for Italian speech recognition tasks, trained on the training split of Common Voice 7.0 (Italian).
Speech Recognition Transformers Other
E
jonatasgrosman
18
0
Exp W2v2t It Xlsr 53 S387
Apache-2.0
An Italian automatic speech recognition model fine-tuned based on the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
18
0
Exp W2v2t It Vp 100k S449
Apache-2.0
An Italian automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-100k-voxpopuli model, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
17
0
Exp W2v2t It Wav2vec2 S609
Apache-2.0
An Italian automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-lv60, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
18
0
Wav2vec2 Xls R 1b Italian Doc4lm 5gram
Apache-2.0
Italian speech recognition model fine-tuned from XLS-R 1B parameter model, supports recognition with language model
Speech Recognition Transformers Other
W
radiogroup-crits
19
1
Wav2vec2 Xls R 1b Italian Robust
Apache-2.0
An Italian automatic speech recognition model fine-tuned on Common Voice 7 and Libri Speech datasets based on facebook/wav2vec2-xls-r-1b
Speech Recognition Transformers Other
W
dbdmg
130
0
Wav2vec2 Xls R 1b Italian
Apache-2.0
This is an Italian automatic speech recognition model based on the XLS-R 1B architecture, fine-tuned on multiple Italian datasets
Speech Recognition Transformers Other
W
jonatasgrosman
2,703
1
Xls R 300m It Phoneme
Speech recognition model fine-tuned on Italian dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
X
patrickvonplaten
17
1
Wav2vec2 Large It Voxpopuli
A speech recognition model pre-trained on unlabeled Italian data from VoxPopuli, using Facebook's Wav2Vec2 architecture
Speech Recognition Other
W
facebook
55
0
Wav2vec2 Base It Voxpopuli
Wav2Vec2 base model pretrained on unlabeled Italian data from VoxPopuli, suitable for speech recognition tasks.
Speech Recognition Transformers Other
W
facebook
32
0
Wav2vec2 Large Xlsr Italian
Apache-2.0
An Italian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 13.91% on the Common Voice Italian test set
Speech Recognition Other
W
joaoalvarenga
27
2
Wav2vec2 Large Xlsr 53 Italian
Apache-2.0
An Italian automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained on the Common Voice 6.1 dataset
Speech Recognition Other
W
jonatasgrosman
1,012
13
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase